Overview

Dataset statistics

Number of variables10
Number of observations43152
Missing cells0
Missing cells (%)0.0%
Duplicate rows92
Duplicate rows (%)0.2%
Total size in memory3.6 MiB
Average record size in memory88.0 B

Variable types

Numeric9
Categorical1

Alerts

Dataset has 92 (0.2%) duplicate rowsDuplicates
carat is highly overall correlated with price and 3 other fieldsHigh correlation
price is highly overall correlated with carat and 3 other fieldsHigh correlation
x is highly overall correlated with carat and 3 other fieldsHigh correlation
y is highly overall correlated with carat and 3 other fieldsHigh correlation
z is highly overall correlated with carat and 3 other fieldsHigh correlation
color has 2236 (5.2%) zerosZeros
clarity has 585 (1.4%) zerosZeros

Reproduction

Analysis started2023-12-09 13:42:28.074092
Analysis finished2023-12-09 13:42:40.225807
Duration12.15 seconds
Software versionydata-profiling vv4.6.2
Download configurationconfig.json

Variables

carat
Real number (ℝ)

HIGH CORRELATION 

Distinct268
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.79823322
Minimum0.2
Maximum5.01
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:40.372095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile0.3
Q10.4
median0.7
Q31.04
95-th percentile1.7
Maximum5.01
Range4.81
Interquartile range (IQR)0.64

Descriptive statistics

Standard deviation0.47334152
Coefficient of variation (CV)0.59298649
Kurtosis1.2331988
Mean0.79823322
Median Absolute Deviation (MAD)0.32
Skewness1.1130548
Sum34445.36
Variance0.22405219
MonotonicityNot monotonic
2023-12-09T14:42:40.557312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.3 2039
 
4.7%
1.01 1792
 
4.2%
0.31 1783
 
4.1%
0.7 1606
 
3.7%
0.32 1487
 
3.4%
1 1213
 
2.8%
0.9 1189
 
2.8%
0.41 1098
 
2.5%
0.71 1044
 
2.4%
0.4 1044
 
2.4%
Other values (258) 28857
66.9%
ValueCountFrequency (%)
0.2 10
 
< 0.1%
0.21 8
 
< 0.1%
0.22 5
 
< 0.1%
0.23 226
0.5%
0.24 200
0.5%
0.25 159
0.4%
0.26 200
0.5%
0.27 195
0.5%
0.28 158
0.4%
0.29 99
0.2%
ValueCountFrequency (%)
5.01 1
< 0.1%
4.5 1
< 0.1%
4.13 1
< 0.1%
3.67 1
< 0.1%
3.65 1
< 0.1%
3.4 1
< 0.1%
3.24 1
< 0.1%
3.22 1
< 0.1%
3.11 1
< 0.1%
3.05 1
< 0.1%

color
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4089266
Minimum0
Maximum6
Zeros2236
Zeros (%)5.2%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:40.698400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6987569
Coefficient of variation (CV)0.49832605
Kurtosis-0.86308919
Mean3.4089266
Median Absolute Deviation (MAD)1
Skewness-0.19045573
Sum147102
Variance2.8857751
MonotonicityNot monotonic
2023-12-09T14:42:40.826005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
3 9015
20.9%
5 7863
18.2%
4 7644
17.7%
2 6707
15.5%
6 5413
12.5%
1 4274
9.9%
0 2236
 
5.2%
ValueCountFrequency (%)
0 2236
 
5.2%
1 4274
9.9%
2 6707
15.5%
3 9015
20.9%
4 7644
17.7%
5 7863
18.2%
6 5413
12.5%
ValueCountFrequency (%)
6 5413
12.5%
5 7863
18.2%
4 7644
17.7%
3 9015
20.9%
2 6707
15.5%
1 4274
9.9%
0 2236
 
5.2%

clarity
Real number (ℝ)

ZEROS 

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.052906
Minimum0
Maximum7
Zeros585
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:40.959616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile6
Maximum7
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.646426
Coefficient of variation (CV)0.53929796
Kurtosis-0.39456707
Mean3.052906
Median Absolute Deviation (MAD)1
Skewness0.55332182
Sum131739
Variance2.7107185
MonotonicityNot monotonic
2023-12-09T14:42:41.099557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
2 10520
24.4%
3 9793
22.7%
1 7306
16.9%
4 6530
15.1%
5 4044
 
9.4%
6 2944
 
6.8%
7 1430
 
3.3%
0 585
 
1.4%
ValueCountFrequency (%)
0 585
 
1.4%
1 7306
16.9%
2 10520
24.4%
3 9793
22.7%
4 6530
15.1%
5 4044
 
9.4%
6 2944
 
6.8%
7 1430
 
3.3%
ValueCountFrequency (%)
7 1430
 
3.3%
6 2944
 
6.8%
5 4044
 
9.4%
4 6530
15.1%
3 9793
22.7%
2 10520
24.4%
1 7306
16.9%
0 585
 
1.4%

depth
Real number (ℝ)

Distinct177
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.743046
Minimum43
Maximum79
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:41.266099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile59.3
Q161
median61.8
Q362.5
95-th percentile63.8
Maximum79
Range36
Interquartile range (IQR)1.5

Descriptive statistics

Standard deviation1.4282431
Coefficient of variation (CV)0.023132048
Kurtosis5.280629
Mean61.743046
Median Absolute Deviation (MAD)0.7
Skewness-0.087590167
Sum2664335.9
Variance2.0398784
MonotonicityNot monotonic
2023-12-09T14:42:41.609712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
62 1787
 
4.1%
61.9 1739
 
4.0%
61.8 1670
 
3.9%
62.2 1648
 
3.8%
62.1 1613
 
3.7%
61.6 1549
 
3.6%
61.7 1536
 
3.6%
62.3 1518
 
3.5%
62.4 1411
 
3.3%
61.5 1355
 
3.1%
Other values (167) 27326
63.3%
ValueCountFrequency (%)
43 1
< 0.1%
44 1
< 0.1%
51 1
< 0.1%
52.2 1
< 0.1%
52.3 1
< 0.1%
52.7 1
< 0.1%
53 1
< 0.1%
53.1 1
< 0.1%
53.2 2
< 0.1%
53.4 1
< 0.1%
ValueCountFrequency (%)
79 1
< 0.1%
78.2 1
< 0.1%
73.6 1
< 0.1%
72.9 1
< 0.1%
72.2 1
< 0.1%
71.8 1
< 0.1%
71.6 1
< 0.1%
71.3 1
< 0.1%
70.8 2
< 0.1%
70.6 2
< 0.1%

table
Real number (ℝ)

Distinct125
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean57.459548
Minimum43
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:41.797256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum43
5-th percentile54
Q156
median57
Q359
95-th percentile61
Maximum95
Range52
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.2293248
Coefficient of variation (CV)0.038798162
Kurtosis3.1564265
Mean57.459548
Median Absolute Deviation (MAD)1
Skewness0.80595361
Sum2479494.4
Variance4.9698891
MonotonicityNot monotonic
2023-12-09T14:42:41.973454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56 7942
18.4%
57 7771
18.0%
58 6647
15.4%
59 5298
12.3%
55 4979
11.5%
60 3452
8.0%
54 2061
 
4.8%
61 1812
 
4.2%
62 1000
 
2.3%
63 472
 
1.1%
Other values (115) 1718
 
4.0%
ValueCountFrequency (%)
43 1
 
< 0.1%
44 1
 
< 0.1%
49 1
 
< 0.1%
50 2
 
< 0.1%
50.1 1
 
< 0.1%
51 8
 
< 0.1%
51.6 1
 
< 0.1%
52 40
 
0.1%
52.8 2
 
< 0.1%
53 442
1.0%
ValueCountFrequency (%)
95 1
 
< 0.1%
79 1
 
< 0.1%
76 1
 
< 0.1%
73 2
 
< 0.1%
71 1
 
< 0.1%
70 5
 
< 0.1%
69 7
 
< 0.1%
68 18
 
< 0.1%
67 32
0.1%
66 71
0.2%

price
Real number (ℝ)

HIGH CORRELATION 

Distinct10694
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3939.4907
Minimum326
Maximum18818
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:42.149013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum326
5-th percentile544
Q1956
median2401
Q35354.25
95-th percentile13105.35
Maximum18818
Range18492
Interquartile range (IQR)4398.25

Descriptive statistics

Standard deviation3990.001
Coefficient of variation (CV)1.0128215
Kurtosis2.1498373
Mean3939.4907
Median Absolute Deviation (MAD)1671
Skewness1.6112291
Sum1.699969 × 108
Variance15920108
MonotonicityNot monotonic
2023-12-09T14:42:42.333828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
605 109
 
0.3%
698 101
 
0.2%
789 101
 
0.2%
625 99
 
0.2%
828 99
 
0.2%
544 98
 
0.2%
776 92
 
0.2%
802 92
 
0.2%
552 92
 
0.2%
720 92
 
0.2%
Other values (10684) 42177
97.7%
ValueCountFrequency (%)
326 2
< 0.1%
327 1
< 0.1%
334 1
< 0.1%
336 1
< 0.1%
337 1
< 0.1%
338 1
< 0.1%
339 1
< 0.1%
340 1
< 0.1%
342 1
< 0.1%
344 1
< 0.1%
ValueCountFrequency (%)
18818 1
< 0.1%
18806 1
< 0.1%
18804 1
< 0.1%
18797 1
< 0.1%
18795 2
< 0.1%
18791 1
< 0.1%
18788 1
< 0.1%
18787 1
< 0.1%
18781 1
< 0.1%
18780 1
< 0.1%

x
Real number (ℝ)

HIGH CORRELATION 

Distinct544
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.7326071
Minimum0
Maximum10.74
Zeros7
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:42.525630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.29
Q14.72
median5.7
Q36.54
95-th percentile7.65
Maximum10.74
Range10.74
Interquartile range (IQR)1.82

Descriptive statistics

Standard deviation1.1201963
Coefficient of variation (CV)0.19540783
Kurtosis-0.61092113
Mean5.7326071
Median Absolute Deviation (MAD)0.92
Skewness0.37530417
Sum247373.46
Variance1.2548397
MonotonicityNot monotonic
2023-12-09T14:42:42.717968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.38 359
 
0.8%
4.37 350
 
0.8%
4.34 339
 
0.8%
4.39 324
 
0.8%
4.33 324
 
0.8%
4.32 320
 
0.7%
4.35 313
 
0.7%
4.31 312
 
0.7%
4.41 308
 
0.7%
4.36 301
 
0.7%
Other values (534) 39902
92.5%
ValueCountFrequency (%)
0 7
< 0.1%
3.73 2
 
< 0.1%
3.74 1
 
< 0.1%
3.76 1
 
< 0.1%
3.77 1
 
< 0.1%
3.79 2
 
< 0.1%
3.81 3
< 0.1%
3.82 2
 
< 0.1%
3.83 3
< 0.1%
3.84 3
< 0.1%
ValueCountFrequency (%)
10.74 1
 
< 0.1%
10.23 1
 
< 0.1%
10 1
 
< 0.1%
9.86 1
 
< 0.1%
9.54 1
 
< 0.1%
9.53 1
 
< 0.1%
9.51 1
 
< 0.1%
9.49 1
 
< 0.1%
9.44 3
< 0.1%
9.42 2
< 0.1%

y
Real number (ℝ)

HIGH CORRELATION 

Distinct543
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.736434
Minimum0
Maximum58.9
Zeros6
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:42.908185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4.3
Q14.73
median5.71
Q36.54
95-th percentile7.64
Maximum58.9
Range58.9
Interquartile range (IQR)1.81

Descriptive statistics

Standard deviation1.1474997
Coefficient of variation (CV)0.20003712
Kurtosis112.0351
Mean5.736434
Median Absolute Deviation (MAD)0.92
Skewness2.9098111
Sum247538.6
Variance1.3167556
MonotonicityNot monotonic
2023-12-09T14:42:43.092322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.34 348
 
0.8%
4.37 348
 
0.8%
4.35 331
 
0.8%
4.38 329
 
0.8%
4.33 326
 
0.8%
4.32 323
 
0.7%
4.39 321
 
0.7%
4.4 321
 
0.7%
4.41 308
 
0.7%
4.36 301
 
0.7%
Other values (533) 39896
92.5%
ValueCountFrequency (%)
0 6
< 0.1%
3.68 1
 
< 0.1%
3.71 2
 
< 0.1%
3.72 1
 
< 0.1%
3.73 1
 
< 0.1%
3.75 1
 
< 0.1%
3.77 2
 
< 0.1%
3.78 5
< 0.1%
3.82 1
 
< 0.1%
3.83 1
 
< 0.1%
ValueCountFrequency (%)
58.9 1
< 0.1%
31.8 1
< 0.1%
10.54 1
< 0.1%
10.16 1
< 0.1%
9.85 1
< 0.1%
9.81 1
< 0.1%
9.48 1
< 0.1%
9.46 1
< 0.1%
9.42 1
< 0.1%
9.4 1
< 0.1%

z
Real number (ℝ)

HIGH CORRELATION 

Distinct366
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5392559
Minimum0
Maximum31.8
Zeros19
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size674.2 KiB
2023-12-09T14:42:43.271487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2.65
Q12.91
median3.53
Q34.04
95-th percentile4.73
Maximum31.8
Range31.8
Interquartile range (IQR)1.13

Descriptive statistics

Standard deviation0.70806215
Coefficient of variation (CV)0.20005961
Kurtosis58.221996
Mean3.5392559
Median Absolute Deviation (MAD)0.57
Skewness1.7929706
Sum152725.97
Variance0.50135201
MonotonicityNot monotonic
2023-12-09T14:42:43.456609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.7 599
 
1.4%
2.69 596
 
1.4%
2.71 586
 
1.4%
2.68 579
 
1.3%
2.72 575
 
1.3%
2.67 515
 
1.2%
2.73 487
 
1.1%
2.74 438
 
1.0%
2.66 427
 
1.0%
4.02 421
 
1.0%
Other values (356) 37929
87.9%
ValueCountFrequency (%)
0 19
< 0.1%
1.07 1
 
< 0.1%
1.53 1
 
< 0.1%
2.06 1
 
< 0.1%
2.24 1
 
< 0.1%
2.25 1
 
< 0.1%
2.26 1
 
< 0.1%
2.27 1
 
< 0.1%
2.29 1
 
< 0.1%
2.3 1
 
< 0.1%
ValueCountFrequency (%)
31.8 1
< 0.1%
8.06 1
< 0.1%
6.98 1
< 0.1%
6.72 1
< 0.1%
6.43 1
< 0.1%
6.38 1
< 0.1%
6.27 1
< 0.1%
6.13 1
< 0.1%
5.98 1
< 0.1%
5.97 1
< 0.1%

cut
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size674.2 KiB
4
17259 
3
11016 
2
9700 
1
3902 
0
 
1275

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters43152
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row3
4th row1
5th row2

Common Values

ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Length

2023-12-09T14:42:43.628975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-09T14:42:43.780261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Most occurring characters

ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 43152
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Most occurring scripts

ValueCountFrequency (%)
Common 43152
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43152
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 17259
40.0%
3 11016
25.5%
2 9700
22.5%
1 3902
 
9.0%
0 1275
 
3.0%

Interactions

2023-12-09T14:42:38.703771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.036178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.185101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.345868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.701562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.936915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.078521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.397969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.535604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.825866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.172610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.308277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.472524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.828089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.064112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.203027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.516501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.690636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.954395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.297168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.433370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.632080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.961169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.191648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.345670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.651263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.814237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.087441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.423701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.565935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.766444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.099656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.356797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.603986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.790051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.953568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.214555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.554192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.700523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.904553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.233174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.482065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.740188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.919614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.083427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.334650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.670823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.827525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.039626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.354561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.597198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.863233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.037216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.199720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.468811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.802384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.961772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.183882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.490602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.724436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:35.998691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.167353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.334943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.588839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:29.923863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.087868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.394971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.635163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.836830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.126347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.284959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.454992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:39.714370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:30.055573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:31.221258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:32.549522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:33.770743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:34.958822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:36.261336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:37.407582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-09T14:42:38.577266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-09T14:42:43.905829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
caratclaritycolorcutdepthpricetablexyz
carat1.000-0.372-0.2490.1160.0330.9630.1920.9960.9960.993
clarity-0.3721.000-0.0300.143-0.071-0.209-0.158-0.369-0.364-0.371
color-0.249-0.0301.0000.040-0.052-0.150-0.030-0.245-0.245-0.250
cut0.1160.1430.0401.000-0.196-0.093-0.477-0.125-0.125-0.147
depth0.033-0.071-0.052-0.1961.0000.013-0.248-0.021-0.0230.106
price0.963-0.209-0.150-0.0930.0131.0000.1680.9630.9630.957
table0.192-0.158-0.030-0.477-0.2480.1681.0000.1990.1930.157
x0.996-0.369-0.245-0.125-0.0210.9630.1991.0000.9980.987
y0.996-0.364-0.245-0.125-0.0230.9630.1930.9981.0000.987
z0.993-0.371-0.250-0.1470.1060.9570.1570.9870.9871.000

Missing values

2023-12-09T14:42:39.880121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-09T14:42:40.092524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

caratcolorclaritydepthtablepricexyzcut
265462.014158.164.0162318.238.194.771
91591.015160.060.045406.576.493.922
141311.102362.558.057296.596.544.103
157571.505161.565.063007.217.174.421
246321.523462.157.0129687.277.324.532
498280.563659.759.021675.415.353.213
386820.304661.955.010414.324.342.684
446040.533361.856.016075.215.183.214
114591.153362.254.050086.746.654.174
24950.515661.356.031975.135.213.174
caratcolorclaritydepthtablepricexyzcut
471910.515462.356.018375.145.133.203
219621.521462.959.9100327.277.314.592
371940.461462.356.09744.964.933.084
168501.005362.759.067206.386.313.983
62650.876161.454.040126.156.203.794
112841.051362.459.049756.486.514.052
447320.476461.055.016175.035.013.064
381580.334760.358.010144.494.462.702
8600.900262.859.028716.136.033.823
157951.144260.458.063206.826.794.113

Duplicate rows

Most frequently occurring

caratcolorclaritydepthtablepricexyzcut# duplicates
560.793262.357.028985.905.853.6644
00.300463.457.03944.234.262.6912
10.300463.457.05064.264.232.6922
20.302262.257.04504.274.282.6642
30.303762.155.08634.324.352.6942
40.306262.258.07094.314.282.6732
50.312363.057.04894.324.342.7322
60.312363.057.06284.344.322.7342
70.312562.457.08024.364.322.7142
80.312661.655.06874.374.402.7042